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An algorithm is stated for finding the simple cycles in a digraph 
which is believed to be superior to previous algorithms. The algorithm 
is stated in a way which lends itself to use on a digital computer. 
Suitable modifications are presented which allow the algorithm to be 
applied to coalesced graphs. Finally, the algorithm is compared to a 


previously used technique, and is shown to require fewer Operations. 
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I. INTRODUCTION 


One of the important problems in the theory of graphs, especially in 
its applications to physical systems, is that of locating the simple cycles 
in a directed graph. The author conducted an extensive literature search 
(all American Mathematical Society Reviews, I.E.E.E. Circuit Theory Re- 
views, and American Computing Machinery Reviews), and was unable to find 
any really efficient methods for solving the problem. One of the more 
efficient eae was the algorithm of Danielson [3], which uses matrix 
techniques and is designed for application to undirected graphs, although 
it lends itself to directed graphs if certain modifications are made. In 
comparison with the algorithm which we shall develope, it will be seen 
that it is itself rather inefficient. 

We will first give a number of fundamental concepts, with examples, 
in order to familiarize the reader with the basics involved, and in order 
to make the paper as nearly self contained as possible. In this regard, 
we have adapted considerable material from Busacker and Saaty [2]. 

After a consideration of the basic theory, we develope an algorithm 
for locating the simple cycles in a directed graph. In doing so, we state 
some specific definitions which may not be found in the literature, but 
which are fundamental to our algorithm. After presenting the algorithm, 
we give an example to show how it works, and then we prove that the algori- 
thm does indeed do what it is supposed to do. 

Following the statement of the algorithm, we give modifications which 
allow the algorithm to be applied to aa leseed \eaaiaiig (graphs in which a 
subgraph is represented by a single vertex. We then give an example, 


which is taken from circuit theory, and show how the algorithm is applied. 





II. FUNDAMENTAL CONCEPTS 


Let S be a set. Suppose s is an element of S, and that t is an ele- 
ment of S. By the symbol (s&t) we mean the unordered pair s and t (un- 
ordered in the sense that (s&t) and (t&s) denote the same element). We 
shall use the symbol (S&S) to denote the unordered prodtict of S with it- 
self. Now, let V be any set which is non-empty, and let E be a set dis- 
joint from V. Define a mapping g from E into (V&V). An undirected graph 


is the triple (V,E,g). We denote such a graph by 


op) 
t 


- G(V,E,g), 
Or, simply 


G = G(V,E) 
if the mapping is understood. We call the elements of V vertices, the 


elements of E edges, and g is called the incidence mapping for G. 


PeauEectcdeenapa is the triple 


G = G(V,E,g'), 
where V and E are defined as above and where g' maps E into (VxV), the 
unusual Cartesian product of V with itself. 

We use the terminology of Harary [4] and call a directed graph a di- 
graph. As in the undirected case, we call the elements of V vertices and 
the elements of E edges. We shall denote both elements of V and elements 
of E by subscripted lower case letters. If g' is one-to-one, we shall 
sometimes denote an element e of E by the pair (v) sv) of vertices for which 
g'(e)= (Visvo): Note that to each digraph there corresponds an undirected 
graph formed from the digraph by removing the direction from all edges. 

A « aph 


H = H(V,» EL 81) 





is said to be a subgraph of a graph G = G(V,E,g) if and only if the following 
conditions hold: 
Gi) VE, 
Gm) = E, 
(iii) g, = 8 ee 
where g 4 denotes the restriction of g to the subset or one Ve 
If e is an edge in a graph G = G(V,E,g), and if 
g(e) = (véw) 
(or (v,w) or (w,v) in the case of a digraph), then e is said to be in- 
cident to v and w. If the graph is directed, then an edge e such that 
g(e) = (v,w) 
is said to be incident into w and incident out of v. Alternately, v is 
referred to as the initial vertex of e and w is referred to as the final 
vertex of e. A vertex v in a digraph whose incident edges are incident 
out of v is called a source; a vertex w whose incident edges are incident 
into w is called a sink. 
Geometrically, we represent an undirected graph by associating each 
vertex with a geometrical point and each edge with a curve segment. A 
digraph is represented in the same way, except that the curve segment is 


directed according to the order of the vertices corresponding to the edge. 


Figure l(a) shows a geometrical representation of an undirected graph, 





' 
while Figure 1(b) shows a digraph. ei 
Va 
Vi V3 
2 
mV 
4 
Ve Cs ’ 
Figure l(a) an undirected graph (b) a digraph 





For the graph in Figure l(a), 


va fv ssvps¥ge¥qe¥sh : 

E = Fey 21830842586} ; 

Bie, - (vo&v,), a - (v,6v,), en - (v/&v—), = (v,&v_), 

eg - (v,&v5); 2 (v,&v3). 
For the graph in 1(b), 
vi = { visvyavyovsovs f ; 
E' fej.e;,eq,e, .edseg) : 
' 


; . eet 1 tof ‘a tof mY. tof 
&: oA (Vj 9V,)> =) (Vso) > 3 (Vo5V3)5 = (v3 5V)) 


to thant —s tent 
e, (V4 5Vo)> e6 (vo .V,)- 
In the sequel, we shall consider only digraphs. 

In a digraph, two edges are said to be consecutive provided the 


initial vertex of one is the final vertex of the other. Thus, two edges 


ey and eo such that 


g(e,) = (a,b) and g(e,) = (b,c) 
are consecutive edges. A sequence of distinct consecutive edges of the 
form 
P = (€) ,€52€g5+++@) 4) 

such that 

g(e,) = (v),v5), Ble,) = (Vosv,), B(e3) = (vg,V,)5-++,8€e, 5) = 

eet” etre” ” SY Bua 
is called a path. Note that no edges are repeated in a path, however, 
vertices may be repeated. A subsequence P' of consecutive edges is 
called a subpath of the path P. If P' starts with the same edge e, as 
P, we say P' is an initial subpath of P. We shall say a vertex Vv. is 


attainable from a vertex NS provided there is a path from 2 to Vi- 





Alternately, we say that 5 is a descendant of ve and that v_ is a pre- 
O 


decessor of v.. 
i 
We define a function d on the set © of all paths in a graph G in- 


to the set of natural numbers (including zero) by 


de) = n, Pin G@ . 
where n is the number of edges in the path P. d(P) is referred to as 
the length of the path P. Note that an edge is a path of length one. 
Let G = G(V,E,g) be a digraph. Define a multi-valued function f 
on V into V according to the following rule: 
f(v) = kar v' in V and there exists e in E such that g(e) = 
(viv) , for all v in V. We call f the descendent function of the 


vertex v. We define powers of f as follows: 


£4 (v) = or ey slg) EE So lye' an Vain We 
whe re ie is the set of all descendants of v by a path of length j. For 
example, in Figure 1(b), we have 
cop = {op sh. hop = Lopvpwg} 

A special type of path in a digraph is one which starts at a vertex 
6 and ends at a vertex “> where Wie Such a path is called a cycle. 
If no vertex (except vo? is used more than once in a cycle, then the 
cycle is said to be simple. A simple cycle of length one is called a 
loop. A graph which contains no cycles is referred to as acyclic. 

Suppose G = G(V,E,g) is an acyclic graph in which all vertices are 
attainable from a single vertex oe in V. Further, suppose no vertex has 
more than one edge incident into it. Then G is called a rooted ditree. 


ae is called the root of G. Figure 2 below shows a rooted ditree 





Qo 





A; § O, 


Qe Ag Gy. (Oy 
Figure 2. A rooted ditree. 


Note that a rooted ditree must have only one root. A vertex in a rooted 
ditree is said to be terminal if it has no descendants. Thus, in the 


graph of Figure 2, a, is a root and a are terminal 


0 g? 797 9429197711°77 
vertices. A branch in a rooted ditree is any path whose initial vertex 
is the root and whose final vertex is terminal. Hence, the sequence of 
edges 

B= ( (a)585)+ (85585) (855249) ) 
(note that since there are no multiple edges between any pair of vertices, 
there is no confusion if we represent an edge by its associated pair of 
vertices) is a branch of the ditree in Figure 2. 
A graph G = G(V,E,g) whose vertex set V can be partitioned into two 
non-null disjoint sets V, and V, such that E consists only of edges e, 


1 2 
such that 


ge.) = SEY or (Vivi) De in Vi» v; in Vo» 


is called a bipatite directed graph or bi-digraph. The graph in Figure 


3 is a bi-digraph. 


Note that 


V = {8125125982858 2) 985 04 


fa, 24185220} ) 2798498586 ™ v Uv... 


10 





| a - 


(0) Cis , 
a. a Q 6 
As “p ry 
Figure 3. A bi-digraph 


We have the following result concerning bi-digraphs. 
THEOREM I: If G is a biedigraph, then all simple cycles in G are of 
even length. 


Proof: Supj:ose C is a simple cycle in G of the form 
C= (2) 5s veey@ 122 )5 


where d(C) is odd. If the length of C is odd, then the initial 
vertex of C must be different from the final vertex by the definition 
of bi-partite graph. But this is impossible since the initial and final 


vertex of a cycle must coincide. Therefore, d(C) is even. 


11 





III. AN ALGORITHM FOR FINDING THE SIMPLE CYCLES IN A DIGRAPH 


An important problem, especially in the application of graph techni- 
ques, is that of locating simple cycles in a digraph. In dealing with 
large graphs (large in the sense that V and E contain many elements), 
it is useful to have an efficient algorithm which conserves memory space 
of a computer. The algorithm we shall develope in this chapter is de- 
signed to meet these requirements. In the final chapter of the paper, we 
will show how our algorithm compares with an algorithm given by Denielson 
iB: 

Before starting, it will be necessary to introduce some new terminol- 
ogy. In the sequel, we shall denote the vertices of all graphs under 
consideration by intecer-indexed lower case letters such that no two 
vertices have the same index. If j is an integer such that v; occurs in 
a graph under consideration, we will say v, is the vertex corresponding 


to the indexing integer j. 


DEFINITION 1. 

Let G = G(V,E,g) be a digraph. Let v. be an element of V. Then, 
the indexing integer j corresponding to “E is said to be an adjacent 
element of A provided U4 belongs to f(v,), where f is the descendent 
function of Vi: 


For example, in the graph in Figure 3 above, the adjacent elements 


of Vv, are the integers 6 and 9. 


DEFINITION 2. 

If G = G(V,E,g) is a digraph, and if v. is a vertex in V, then the 
adjacency set of v5 is the set of all adjacent elements of ie (e.g., the 
adjacency set of V5 is the set of index-integers corresponding to the 


elements of f(v,)). 12 





For example, in the graph in Figure 3, the adjacency set of a. is 
the set {6,91 . If no vertices are adjacent to a vertex (i.e., if a ver- 
tex is a sink) we denote its adjacency set by @ (the null set). 

In the discussion which follows, we assume that the graph under con- 
sideration, G = G(V,E,g) is a digraph on n vertices which are represented 
by the lower case letter v indexed l through n. We further assume that 
in order to communicate to the computer the structure of G, we have a 


list which associates each vertex in V with its adjacency set. 


DEFINITION 3. 

An attainability chain of a vertex 7 in a digraph G is a directed 
graph C., which corresponds to a simple path AT ae in G. The source 
vertex of C. is i (corresponding to Vis and the sink vertex is either a 
sink vertex in G, or a copy k' of the first repeating vertex v, in the 
path. 

For example, consider the graph in Figure 3. The vertex ay has the 
following attainability chains: 


1 1 





Figure 4. Attainability chains for the vertex a, of the graph in Figure 3. 


i 
Note that if a simple cycle occurs, as in the cycle composed of the ver- 
tices Ay» Ags Ap, Ans ays then the attainability chain of any of these 
vertices terminates as soon as the first repetition of a vertex occurs. 


Also, note that each attainability chain is a rooted ditree with one 


branch. 


13 





DEFINITION 4. 

ome vacttaanapility tree for a vertex Vv. is a rooted ditree with a 
vertex corresponding to i in the root position and with branches con- 
sisting of all the attainability chains of Vas We will super-impose the 
initial subpaths of any attainability chains with identical initial sub- 
paths. 

The graph in Figure 5 shows the attainability tree for the vertex a 


l 
of Figure 3. 





g 


Figure 5. An Attainability Tree 
We now give the formal statement of the algorithm, keeping in mind 
that the graph under consideration is a digraph G = G(V,E,g) on n ver- 
tices, and that we have a list 5: vertices and their corresponding 


attainability sets. 


STEP 1. Inspect the list and eliminate all vertices whose index 
integers do not appear as adjacent elements. Obviously, these 
vertices could not belong to simple cycles because they have no 
edges incident into them (i.e., they are sources). Also, inspect 
the list and eliminate as roots all vertices whose adjacency set is 
G@. Again, it is clear that such vertices car:inot belong to any 
simple cycles because there are no edges incident out of them 


(i.e., they are sinks). We have thus eliminated all sources and 


14 





sinks as possible roots. We may now repeat the procedure on the 


remaining vertices until no more eliminations are possible. 


STEP 2. Pick any vertex V5 from the 1l°st which has not been eli- 
minated and construct its attainability tree, consulting the 


origional list for incidence relations. Recall that we super- 


impose identical initial subpaths. 


STEP 3. If an element is repeated along any attainability chain, 
record that element together with the elements between the two 
occurrence (jn the order they appear). The vertices corresponding 
to these elements will compose a simple cycle in G. Note that 
repeated vertices must occur along the same branch of the at- 


tainability tree for a cycle to be located. 


STEP 4. Eliminate as possible roots all elements which appeared in 


the attainability tree of v for if these vertices belonged to 


5 iad 
simple cycles, this would have been discovered in the attainability 


tree of Vy° 


STEP 5. If all vertices have been eliminated as possible roots, 
the algorithm terminates. If some vertices remain, choose any such 
vertex va not eliminated as a possible root, construct its attain- 
ability tree and return to step 3. 

We now ask the following question: Does the algorithm locate all 


the simple cycles in a digraph (and only the simple cycles)? We answer 


this question with the following theorem: 


15 





THEOREM II. The algorithm outlined in Steps 1 through 5 above locates all 
of the simple cycles of a digraph, and it locates only simple cycles. 

Proof: First, note that no cycles will be located which are not 
simple, since a branch is terminated at the first repetition of a vertex. 

Now, suppose there is a simple cycle C in the digraph G which was 
not located by the algorithm. Then C can have no edges in common with 
any attainability tree considered in the sequence, for if it did, then 
by the fact that a branch is terminated only if a source, a sink, or a 
repetition occurs, C would have been located. Conversely, only those 
edges of G which involve sources or sinks in G are left out of the set 
of attainability trees. Thus, we have established that if C is not 
located by the algorithm, then C does not belong to G, which is a contra- 
diction. Therefore, C must have been located by the algorithm. 

It should be noted at this point that some cycles may be located 
more than once. However, this will occur only in special cases. 

As an example of the application of this algorithm, consider the 


digraph shown in Figure 6 below. 


Vs 
Vv | 
Vie 
Vie 
Va ww 
Vs Vq Vig 
Va Muy Vig 
’ Vv g 


Figure 6. A Digraph 


16 





We have the following list of vertices with their adjacency sets: 


1 fa} 
ape 
vy? {arf 
fu} 
ve: {10, ui} 
ae, 
v5: f13, wh 
ve? {ust 
yt as 
“10° {6} 
vit fT» 8 
Vint $5. 9 
vig: £95 


g 


Vv 


jig 


Applying Step 1, we eliminate Vig as a possible root, since its adjacency 


ih 


set is $. We also eliminate V2 Vor Vao anduyv-,.esince lk. 2.3. cama 4 


3 a 


do not occur in any of the adjacency sets. Let us now construct the 


attainability tree of v 


10° 10 
G 
12 
< 
Qq 
* i » \* 
CYCLE a SINK 
12 Q “~~ 8 
14. 13 
q SINK q 
4 Picueet 7: 14 
SINK 


SIN K 
Attainability tree for Yi0 of Figure 6 
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We have located one simple cycle, namely 


C= (199%) MGM 12)» %179%5) 59% 10) ) 


Note that 5, 6, 7, 8, 9, 10, 11, 12, 13 all appeared in the attainability 
tree of Vio? and that l, 2, 3, 4, and 14 have already been eliminated 
as roots. Hence, by Step 4, we need construct no more attainability 
trees, and we can conclude that C is the only simple cycle in the graph. 
An inspection of Figure 6 also leads to this conclusion. 

Note that if the graph under consideration were a bipartite digraph 


(or bi-digraph), we would have had two lists instead of one. In this 


case, we would add the following step the simplifications above: 


STEP 0: Eliminate as possible roots all vertices which occur in the 
list with the most vertices. This will be sufficient to locate all 
simple cycles by Theorem I which states that all cycles in a bi- 
digrap! have even length, and hence must contain at least one vertex 
from each list. This simplification will be used in the next 


chapter. 


We observe at this point, that though the primary purpose of our 
algorithm as stated is to locate simple cycles, it may also be used to 
locate any simple path in a digraph. For example, if we wish to know 
if there is a simple path between a vert*x Yo and a vertex v. in a di- 
graph, we need only construct the attainability tree of Vo and note 


whether it contains Vee 


18 





IV. COALESCED GRAPHS 


The purpose of the foregoing algorithm is to make possible the loca- 
tion of all simple cycles in a digraph with a large number of vertices. 
In practice, in the analysis of very large graphs, such as the graph of 
the circuitry of a digital computer, a procedure is used which partitions 
the graph into subgraphs which are, in turn, partitioned into smaller sub- 
graphs, until manageable subgraphs are obtained. Once these subgraphs 
have been analyzed for cyclic structure, they are generally coalesced 
into a single vertex, and the resulting graph is then analyzed for its 
cyclic structure. In this chapter, we will show how the technique of 
Chapter II may be applied to such "coalesced graphs." 

One of the principal problems in locating the ite cycles in a 
coalesced graph is that a vertex which is a coalesced subgraph may be 
attainable from a vertex v, but a vertex v* which is attainable from 
the coalesced subgraph may not be attainable from v in the original 


graph. For example, suppose G is the graph shown in Figure 7. 


V 





b C 


Figure 7. A Digraph 
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Suppose that, by some procedure, we determine that the subgraph of G 
composed of the vertices a, b, c, d, e, and £, together with the edges 
connecting them, can be coalesced into a single vertex oe Then, the 


graph shown in Figure 8 is the coalesced version of the graph in Figure 


io 





Figure 8. The Coalesced Version of the Graph in Figure /7 

Note that this graph is bipartite, since one node actually represents a 
subgraph, while the others represent single vertices in the Original 
graph. Also, note that in the graph in Figure 8, v* is a descendant of 
v but, in the original graph (Figure 7), it is not. Thus, we see that 
our algorithm cannot be applied to coalesced graphs without some modi- 
fication. 

Let Go = GV E98) be a digraph without coalesced vertices. 
Let S be a subgraph of Go: Form the graph G. = GV EB) in which 
S has been coalesced into a single vertex.s. Let E be the descendent 
function on the vertices of Go» and let f. be the descendent function on 
the vertices of Gos Define subsets I(S) and J(S) on the vertex set - 


of CG. as follows: 


fy, in ue: s is in cv 4 


fy, in ie Me is in £(s) 


I(S) 


J(S) 
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Now, define a mapping p on I(S) x J(S) into 0,1 such that 


O if v, is not in E(v,) for all k, 
J l if there exists k such that vs is in fi ov,)- 


Compute PAV ian ; ) for all pairs es with v, in I(S) and Vg ir. JCS) 


i 


as follows: Let S. be in S such that Ss. is also in Ep: Let S be 


such that we is in o Bede Note that s, and s, must exist since [ is 


i j 
in I(S) and ue ispan J¢S). Wee T. be the attainability tree of Sy 
i 
and suppose We is the vertex set of T. »- Then, if is in Ve r 
i i i 


otherwise, 
gp ee = 0. 
Note that Ti may not exist, in which case ee? is obviously zero. 
i 
When ig ee has been computed for all ordered pairs me.) with 
v, in I(S) and ee in J(S), we apply the algorithm of Chapter II to the 
bi-digraph Go- Suppose we do this and obtain simple cycles Ci» See ueuins ci 


Note that by Theorem I, the length of C, (i = 1, ... , n) will be even. 


Hence, if m, is the number of vertices in Ci then m,/2 is the number of 


d. 
coalesced subgraphs in C.. Denote these coalesced subgraphs by Ke ki: 
ee¢@ k e 
> ¢ 9) 
im, / 
Let 
Take = \v. in ies : Soy is in swt ~ laa. ny i] = UL, ++ e ym, /2 
Jk; 5) = vy in ve ov) is in fk, ,)} wee wes Gems 6} = vee gm,/2 


For each ae Celie awn. | = -lyssacq m,/2) form the ordered pair 


), where fol is in and t, 


jt is in 28 ey, and both ae 


Sel yt 1 


and t, are in C,. Now, the ordered pair (t 


j+l ‘ tay must be in the 


i ae 
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domain of p, since eae is in I(k; x JU, 4): Note the value of 


p(t, pti): Tf if is zero for amy j (j =~ 1l, ... ; m,/2), then c. is 


not actually a cycle in Go. Tf it@¢emone for all j, then C. is indeed a 


cycle in Go: For example, suppose CG, is the graph in Figure 9 below. 


V V4.9 Va9 Vez V4\ Ve? V63 V6 
\ mC : rel PQ ~C _ _. 





Figure 9. A Digraph 
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This is a digraph of an adder from a digital computer 


Suppose we have 
a procedure which partitions G 


, into the four subgraphs shown in Figure 
10 Ve2 






Vae Vays 

















Vay Ve 
V3 
os 
: V3 Ver 
Ve 
-" SUBGRAPH A 
rr N V&% 
a a. ea a 
V je r 
Vg ei. aa . = “ ; 
anal : =: . @ Vel 
\q 4 Vs ge ea. 
“ ae a he. Ve3 
 _ : SE 7? oa 
V31 —sudzantn ot “ts — ee — 
—S bs 
ce ae 
Ves a ~~ Vv72 
Vit @ Vv 
v 3 6 
‘"e— YO SUG GRAPH GO 
Ye V3.5 
Vq oY 
“We v1 
Mt@— e 
Veo Vio .— a 
V3 O-——.., pe V2.6 e 
Vit auaeceaPn DO, V44 


Figure 10. Four Subgraphs of the Graph in Figure 9 
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We now apply our algorithm to the four subgraphs. 


lists of vertices together with their adjacency sets: 


Subgraph A 


fd 


: {21} 
fork 
F225 
ie: 
» fosf 


{235 
{20} 


: {29} 
a {30} 
: 135f 


£70} 
£5. 
ssl 
€5 
ha 
- 75} 


g 
et 
: fark 
: feat 

g 


Subgraph B 
v, + S27} 
ae 
“+ fal 


ae 
‘on 
v2 $36.37 
: {325 
: $32,33} 
: {334 
: {33 
: faag 
: {41,42 
: $41,435 
: f41,44f 
v5 450¢ 
: $50f 


{50} 


Ny ase $51} 


357 f 
is7 § 


: 34,584 
: {62h 
: 61,63 


%61, "62, 63° 


g 


We have the following 


Subgraph C Subgraph D 
Visi U93,54,553 vy {258 
Y16: 453,54} Vy 3 fos} 
v7! $66, 68f Yaa! {79} 
Vig: 1543 V49: 4798 
Cb 1554 ees fgot 
Trae £565 Te sof 
vet 159,653 Vogt 38,39 
ie: {60} ae f263 
Tea +608 YE: $266 
Tye S60$ Vac! {765 
ae $66,674 Vag! (76,775 
Veg: 1685 Vag 1784 
Yeo! 1 66,67, 68,69} V06 {314 
ge: (67,69 V3! C405 
Ve,? 168,69$ Yio! {775 
We $754 Jae Fre f 
ven? L755 Vag: $45,465 
Ven: {714 voy: {45,475 
Yast 372% = £45,48f 
Vig: 9g Vig! $526 
va: V,93 S525 
vee @ ae: ¥ 524 
Vag: 178 
ven: §49f 
Yn5° g 












We have the following attainability trees for the subgraphs A through D: 

















1\ a2 2.3 ) 3 | ait f 
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a 13 nD “GHD 
1¢ 
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@2t 24 3S 49 485 
X28 | 
, 7 ) 3 
4\ 37 42. =” 34,033 0. 657 
57 S| End FEO Kb) D p 
“ n ey: - boi wot 
a2 ¢ yr 4 GYCLE (S0,s1 34,44) 
END Eh 0 62 ey ot & - —- 
< Eno “ry "nO cyeua END 
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Figure ll. Attainability Trees for the Subgraphs of Figure 10. 
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We have located two simple cycles: 


C1 = 552% 549% 342% 44 9%50) 
C= (7g 5448945924499" 73) 


We now coalesce the four subgraphs into vertices and examine the re- 
sulting bi-digraph for cyclic structure in order to discover if there are 


any larger simple cycles in G.. Figure 12 shows the coalesced graph G . 
yo %4@ @Ys 5 Ver 








“VS 4 


Vt > Ny 
» V4\ 
" Vie * Vag 
<M 
y Novit 
1 Ft. Vv : 
14 


¥ 
Figure 12. The Coalesced Graph of the Graph in Figure 9. 


Since this is a bipartite graph, we have two lists of vertices: 


a 2 

A: F 35,64,754 y yal 
B 41,62, 61,634 v, : Fah 
c: §75,71,72} v, + £At 


: day 
{a} 
: fal 
Ae 


< 


D: 440,45 } 


< 
“ OD UW & WP 


<< 
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Vv *s: : i 
Vis? fot 63° Ca,c) 
Sneee te v, + {B,Df 
17? fc Yn0° {ph 
Vig? {ct Vg ? {3 ,D4 
Vi9? {ch Vi: TB, Dé 
hey e Vans? {33 
Y 


Vai: 

Y60? Fay 
Y61° was 
Vac: OB,DS 
Vos: ) 


We now invoke step O and consider as possible roots only those vertices 


which appear in list V We construct the attainability tree for vertex 
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Figure 13. Attainability Tree for Vertex A of Go: 
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We see that vertices B, C, and D, appear in the attainability tree of A, 
so it is unnecessary to construct any more attainability trees (by Step 
4). 


We have located the following simple cycles in Go: 


C, =(A5V,-;B,Vea94) 
C, =(A5V469B,V¢ 104) 
C., =(A,V362B,Ve3 24) 
C, =(A,V355B,Ve4,C,V7) 2A) 
C, =(A,V565B,Ve41CrV7 5.4) 


ee 35g 
C, =(A,V459D1V) 9B V6 A) 
C, =(A,v,-,D,v 


Cy =(A5V365D9V)99B,V63CsV2) 04) 


=(A,v A) 


35°P?%49902%63 209% 70» 
C), (AsV, 5Dsv,5 2B. V e924) 

Ci5 =(A,Vv45 D5, 52B»V¢ 5 2A). 
C1, 4A,v,,5D,v,55BsV¢5 5A) 

Cy, =(A,V,.5D,V, 52B1Ve,sCsV2, 5A) 
Bas (A,V, 5 5D5V,.55BsV63sCsV 795 
C16 AAs Gy 2Cs¥7 1 9A) 


A) 


c.. “~A,v A) 


17 eae”? 722 


We must now compute the values of p to see which of these simple cycles 


in G. is also a simple cycle in eos We shall do this only for C The 


i ing 
others would be done exactly the same. 


We first note that C, involves the coalesced vertices A and B, with 


in fA); B in fv 35)? and v in f (8). Hence, 


5) 


A in Een yee Vv 4 62 
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we must compute PCV, 2V 35) andp(v ). To compute p(v 


35°62 ao°%357" “° 
examine subgraph A, and note that 62 does not appear in any of the at- 


tainability trees of the vertices of A. Hence, we must construct the 


e e e td e dai ° 
attainability tree of Yeo to see if Va5 is a descendant of Yeo 


GZ 
8 | 


i 6¢ 








B 64 


74 & » 14 
75% e975 
Figure 14. The Attainability Tree of Ve9 
We see that 35 does not appear in the attainability tree of Ve,and thus, 


P(V699V35) = 0. Therefore, C, is not a cycle in Gi: We will not repeat 


I 


the procedure for C, through C however, if we did, we would find that 


2 17’ 


none of these are cycles in Go: 

In actual applications of this technique, we would first determine 
the cyclic structure of each subgraph which is coalesced into a node. 
In addition to retaining the resulting information about cycles, we 
would also note POV V4) for each pair of nodes external to the coalesced 
subgraphs. We would then use this information in constructing the attain- 
ability trees for the coalesced graph. Had we done this above, the at- 
tainability tree in Figure 13 would have been less complicated. We did 
not folloyv this procedure above in order to better illustrate the use of 


the function p. 
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V. A COMPARISON WITH ANOTHER ALGOTHITHM 


We shall now show how the algorithm developed in Chapter II for 
locating the simple cycles of a digraph is much superior, in terms of 
operations required, to another algorithm used for the same purpose. 

Danielson [3] developed an algorithm for locating all paths (in- 
cluding cycles) in an undirected graph using matrix techniques. His 
method, while designed for undirected graphs, can easily be modified 
for application to digraphs. We now state the necessary definitions to 


explain his algorithm. 


- 


DEFINITION 5. 


If G = G(WV,E,g) is an undirected graph on n vertices (v,> ae ve) 


and m edges (e,se,e ), then the adjacency matrix A, of G is given by 


A, = a? where 


1 if an edge connects v, to ee 
ao = 


A) O otherwise. 


DEFINITION 6. 


If G = G(V,E,g) is an undirected graph on n vertices (V)> stolee a vey) 


and m edges (ey; See e then the variable adjacency matrix A of G 


is given by A = (a, 5)» whe re 


Qe. if edge Qe. joins v. to Vie 


O otherwis.. 


So = 
DEFINITION 7. 
If G = G(V,E,g) is an undirected graph on n vertices (vy> eee Vv)» 


then the modified variable adjacency matrix B of G is given by B = (b, 4): 


where 
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v, if an edge joins v, tov., 
. , j ge J 5 j 
ry QO otherwise 
To illustrate these concepts, consider the graph of Figure 15. 
Vi e, Ve 
“ 7 | | ait tate . ——3 6 
e 
Zz €2 
Vs e 
€4 
V3 es 4 
Figure 15. An Undirected Graph 
We have 
0 1 0 0 1 0 e) 0 60 en Vo 0 0O V5 
eeOurOr 1) ey 0 0 Q\g & 0 60 Vv, Ve 
A, = GywO0 CO 1 JIPA= Wo 0 O Qe, yt; B = Vy, V3 Cre 
0 11 0 0 0 ae e, 0 600 Vy V3 0 0 
| a) a ee 6 ae 0 en ey ey 0 60 Vv, V3 0 0 





The algorithm states that all edge progressions of length k between 


v, and a will appear in the i-j position of P 


i , where Pe is defined re- 


k 


crusively as follows: 
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Hence, for the graph in Figure 1L5, 


Vv oVotV.V, ve, Vv fa oe wf a't Be , 
Vote ¥¥ 5V.Y, V4%e vv rua 5V3 ED lee "3 
VVotVe¥5 te 0 v9, 7 6¥3 Xe? "3 
2 ue Menge! 35 ee ae 0 VV, 

Vi +VoV 
Vov giv “ oF a F BB ee y Fo t, Dh VON" 


We first note that, as stated before, the algorithm only locates 
the paths and cycles in an undirected graph. However, it is possible to 
modify the procedure slightly to obtain the cycles of a directed graph. 
How this is done is unimportant; what is important is that to locate all 
paths (including cycles) of lengths, k, k-l, ... , 2 in a digraph, we 
are required to multiply n x n matrices k times. In particular, if we 
were seeking all simple cycles in a complicated graph on n vertices, 
we would have no choice but to perform matrix multiplication n times. 
Now, in using this algorithm, we are not performing matrix multiplication 


in the usual manner, since the elements of the P, 's and of B are symbols 


k 


(i.e., subscripted lower case letters) rather than numbers. Hence, what 
we are really doing is concatenating symbols in order to obtain the ele- 


ments of the P's. Note that the symbol + is included in this concaten- 


ation each time strings of symbols are added together. For example, the 


element of P. above which appears in the second row and fourth column 


(vVjVoW.VotV<V3) is the concatenation of eight symbols. We interpret 


this string of symbols as follows: there are three paths of length three 


from vertex v, to vertex Vays namely via vertices Vy and V5 or via vertices 


Vv. and Vo or villa Vs and V3° We observe that the basic operation of 


5 
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Danielson's algorithm is the writing down of a single symbol. That is, 
in writing the element mentioned above, eight basic operations were in- 
volved. This process takes up more time and memory space than that re- 
quired to perform a comparison of two integers which is the basic operation 
of our algorithm. 

As was mentioned earlier, we are required to perform n nxn matrix 
multiplications in order to analyze an arbitrary graph for cyclic struc- 

: ' . 3 
ture using Danielson’s algorithm. Hence, we must generate n elements, 
each of which may be a long string of symbols. Though some of these ele- 
ments may be zero, at worst, as in the case where each pair of vertices 
is joined by an edge and a loop occurs at each vertex, all elements would 
be non-zero. Now, in the event we were to analyze such a graph, each 
@ 2 kel 

element of PL would contain n- terms of (k-l)n symbols for the nodes 

k-l 


and n - 1 plus (+) signs. Altogether, each of the ne terms of Pe 


would require 
Gin ae -1 <a wen 
symbols. Hence, if the basic operation is that of writing down a single 
symbol, then the number of operations required to locate all paths (and 
thus, cycles) in the worst case is given by 
b= ek? =a). 
k=2 

To obtain an upper bound for the number of basic operations (i.e., 
comparison of integers) using our algorithm, we consider a special type 
of graph, namely the complete symmetric graph on n vertices. A digraph 


G = G(V,E,g) is said to be complete if for every pair of vertices Te and 


in V, there exists an edge e in E such that either g(e) = eae or 
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g(e) = mes): A digraph G = G(V,E,g) is said to be symmetric if 
for every edge e such that g(e) = Se) for v. and ve in V, there 
exists an edge e' such that g(e") = (viav,)- Thus, a complete sym- 


metric graph on 4 vertices is shown in Figure 16 below: 





Figure 16. A Complete Symmetric Graph on 4 Vertices 

We will compute the number of comparisons . necessary to analyze this 
graph for cycles. Clearly this will’be an upper bound on the number of 
comparisons required to analyze any digraph without parallel edges, since 
any other digraph with these properties will be a subgraph of the complete 
symmetric graph on the same number of vertices. 

Note that the adjacency set of each vertex in a complete symmetric 
graph will contain n elements. Hence, at the first stage of the attain- 
ability tree (i.e., at the stage involving all vertices attainable by a 
path of length one from the root vertex), we have n elements which must 
be compared with one eceman tC semaame ly the root vertex. At this stage, 
we will locate one simple cycle: the loop which occurs at the root vertex. 
Thus, we continue on to the second stage with n(n-l) vertices. But, at 


this second stage, we will locate all of the other n-1 loops and all of 
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the simple cycles of length two which involve the root vertex. There 

are n=l of these. Therefore, we continue on to the third stage with 
(n(n-1) - (n-1) - (n-1)) n = (n-l) (n-2)n vertices. Now, at this stage, 
we locate (n-1l) (n-2) loops (obviously some are duplications), (n-1) (n-2) 
cycles of length two, and (n-l) (n-2) cycles of length three. Thus, we 


continue on to the fourth stage with 


eee) (n= 2) = (nema) = (n=l) (n=2))n 
= (n(n-1)(n-2) - 3(n-1) (n-2))n = (n-1) (n-2) (n-3)n 

vertices. At the kth stage, we will have (n-1l)(n-2) ... (n-k+tl)n vertices. 
Now, each of these must be compared with k vertices. Hence, at the kth 
stage, there are kn(n-1)(n-2) ... (n-k+l) comparisons to make. We demon- 
strate this more formally in the following theorem. 
THEOREM III. For a complete symmetric digraph (with loops at each vertex) 
on n vertices, there will be (n-1)(n-2) ... (n-k+tl)n vertices at the kth 
Stage of the attainability tree. 

Proof: The proof is by induction on k. Suppose k is 1. Then the 
number of vertices is n, which is clearly the case, since in the complete 
Symmetric graph on n vertices, each element has n terms in its adjacency 
set. 

Now, suppose the conclusion holds for k = p. That is, suppose that 
at the pth stage, there are (n-1)(n-2) ...(meptl)n vertices. At this stage, 
we will locate (n-1)(n-2) ... (n-ptl) loops, a like number of cycles of 


lengths 2, 3, ... , p. Hence, at the pt! st stage, we will have 


((n(n-1) (n-2) ... (n=-ptl) - p(n-1)(n-2) ... (n-ptl))n 
= n(n-1) (n-2) ... (n-ptl) (n-p) 


vertices. Thus, since p was arbitrary, the conclusion holds in general. 
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We see, then, that an upper bound on the number of comparisons re- 


quired to analyze an n-vertex digraph is given by 


r"\ 
C= os kn(n-1)!/(n-k)! 
k=1 


Now, since a is clearly greater than (n-l)!/(n-k)! for k greater than 
one, C will be less than D (for n 23). In fact, as n becomes large, 

the difference between D and C becomes very great indeed. Also, since 

the complete symmetric graph occurs rarely in practice the number of 
comparisons will generally be very much less than the upper bound C (e.g., 
the graph in Figure 6 which required 79 comparisons as opposed to several 
million for the complete symmetric graph (with loops) on 14 vertices). 


However, since the non-zero elements of P, will generally disappear 


k 


quickly from P, as k increases, the number of basic operations required 


k 
to carry out Danielson's algorithm will often approach D. To illustrate, 
we will compare the number of basic operations required to analyze the 
graph in Figure 17(a) using Danielson's algorithm with the number of 


basic operations required to analyze the graph in 17(b) using our algori- 


hm. é 
V3 Vi 





Vi Ve 
Ve! Vs 
Figure 17 (a) An Undirected Graph (b) A Digraph 
Using our algorithm on Figure 17(b), we obtain the attainability tree 


shown in Figure 18 (the only one for this graph, since all vertices 


appear in it). 
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Figure 18. The Attainability Tree for the Graph of Figure 17(b) 
We see that this graph requires 22 comparisons of integers. 


Using Danielson's algorithm on the graph in Figure 17(a), we have 


ae 3 BES 
Pye, «= YS, 
V> v, Vv, 
and 
“75, VV, PEW GIVY, VV, W5V) F375 
P, =14%"7, 73°77, V{V3V,7,  VyVytV3V,4¥5¥, | ° 
ViVotV,V,4V,¥3 V,V3+¥5V, VV; v,v,*¥5Y, 


We see that this required /8 basic operations. 
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VI. CONCLUSIONS 


We have developed an algorithm for locating the simple cycles of a 
digraph which appears to be superior to previous algorithms in terms of 
the number of basic operations required and which is conservative in its 
use of memory space on the digital computer. In particular, we have analyzed 
our algorithm and the one originated by Danielson [3], and found that the 
upper bound for the number of basic operations using Danielson's algorithm 


was given by 


n 
Ve Dile iy ) 
k=2 


while the upper bound on the number of basic operations using our -algori- 


thm was given by 


C = Si i 
k=] 


Since aoe is greater than (n-1)!/(n-k)!, we found that C was less than D. 
Moreover, since the basic operations required by Danielson's algorithm 
are more complex than those required by our algorithm, we see that our 
method represents a considerable improvement over the matrix technique. 
By introducing the function p defined in Chapter IV, we were able 
to modify our algorithm for application to coalesced graphs. Thus, we 
can analyze extrem:ly cumbersome graphs using the techniques developed, 
without being required to perform an undue number of basic operations. 
It is our opinion that we have accomplished the goal we set out to 
achieve. However, it is believed that the algorithm might be made even 
more efficient. It was noted after the proof of Theorem II that some 
simple cycles may be located more than once. We suggest that there may 


be a way of eliminating this ambiguity. If this could be done, then the 
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number of basic operations required for certain graphs would be cut down 
considerably. 

It was noted that Danielson's algorithm was designed for undirected 
graphs. Clearly, it is not very efficient. We suggest that a method 
similar to the one we have developed could be devised for application 


to undirected graphs which would be superior to Danielson's. 
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